A Survey on Cross-Lingual Summarization

نویسندگان

چکیده

Abstract Cross-lingual summarization is the task of generating a summary in one language (e.g., English) for given document(s) different Chinese). Under globalization background, this has attracted increasing attention computational linguistics community. Nevertheless, there still remains lack comprehensive review task. Therefore, we present first systematic critical on datasets, approaches, and challenges field. Specifically, carefully organize existing datasets approaches according to construction methods solution paradigms, respectively. For each type dataset or approach, thoroughly introduce summarize previous efforts further compare them with other provide deeper analyses. In end, also discuss promising directions offer our thoughts facilitate future research. This survey both beginners experts cross-lingual summarization, hope it will serve as starting point well source new ideas researchers engineers interested area.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

A survey of cross-lingual embedding models

Cross-lingual embedding models allow us to project words from different languages into a shared embedding space. This allows us to apply models trained on languages with a lot of data, e.g. English to low-resource languages. In the following, we will survey models that seek to learn cross-lingual embeddings. We will discuss them based on the type of approach and the nature of parallel data that...

متن کامل

A Survey on Multi-Document Summarization

Multi-document summarization aims at delivering the majority of information content from multiple documents using much less lengthy texts, usually a short paragraph of several hundred words. This paper surveys several different approaches to multi-document summarization by first building a unified high level view of the multi-document summarization problem, and then comparing different approach...

متن کامل

Evaluation of Text Summarization in a Cross-lingual Information Retrieval Framework

We report on research in multi-document summarization and on evaluation of summarization in the framework of cross-lingual information retrieval. This work was carried out during a summer workshop on Language Engineering held at Johns Hopkins University by a team of nine researchers from seven universities. The goals of the research were as follows: (1) to develop a toolkit for evaluation of si...

متن کامل

Complex Cross-lingual Question Answering as a Sequential Classification and Multi-Document Summarization Task

In this paper, we describe the JAVELIN IV system, which treats complex question answering as a sequential classification and multi-document summarization task. Our research and development effort is based on various forms of linguistic annotation, and a comparison of various answer extraction and summarization algorithms. We discuss the use of different units of extraction, the effect of differ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2022

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00520